Mining spatial association rules in census data

نویسندگان

  • Donato Malerba
  • Floriana Esposito
  • Francesca A. Lisi
  • Annalisa Appice
چکیده

In this paper we propose a method for the discovery of spatial association rules, that is, association rules involving spatial relations among (spatial) objects. The method is based on a multi-relational data mining approach and takes advantage of the representation and reasoning techniques developed in the field of inductive logic programming (ILP). In particular, the expressive power of predicate logic is profitably used to represent spatial relations and background knowledge (such as spatial hierarchies and rules for spatial qualitative reasoning) in a very elegant, natural way. The integration of computational logics with efficient spatial database indexing and querying procedures permits applications that cannot be tackled by traditional statistical techniques in spatial data analysis. The proposed method has been implemented in the ILP system SPADA (spatial pattern discovery algorithm). We report the preliminary results of the application of SPADA to Stockport census data. 1. Background and motivation Censuses make a huge variety of general statistical information on society available to both researchers and the general public. Population and economic census information is of great value in planning public services (education, funds allocation, public transportation), as well as in private businesses (locating new factories, shopping malls or banks, as well as marketing particular products). The application of data mining techniques to census data, and more generally, to official data, has great potential in supporting good public policy and in underpinning the effective functioning of a democratic society [29]. Nevertheless, it is not straightforward and requires challenging methodological research, which is still in the initial stages. As an illustrative example of some research issues, let us consider the census data table reported in Figure 1, where each row represents an enumeration district (ED), the smallest areal unit for which census data are published in UK (). () National statistics institutes (NSIs) make a great effort to collect census data, but they are not the only organisations that analyse them: data analysis is often done by different inst itutes. By law, NSIs are prohibited from releasing individual responses to any other government agency or to any individual or business enterprise, so data are summarised for reasons of privacy before being distributed to external agencies and D. Malerba, F. Esposito, F. A. Lisi and A. Appice  Mining spatial association rules in census data 20 The data analyst might be interested in finding some kind of dependence between the active population and the percentage of cars per household. A dependence can be expressed as an association rule, that is an implication of the form

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discovery of spatial association rules in geo-referenced census data: A relational mining approach

Census data mining has great potential both in business development and in good public policy, but still must be solved in this field a number of research issues. In this paper, problems related to the geo-referenciation of census data are considered. In particular, the accommodation of the spatial dimension in census data mining is investigated for the task of discovering spatial association r...

متن کامل

Mining interesting spatial association rules: two case studies

In spatial data mining, a common task is the discovery of spatial association rules from spatial databases. We propose a distributed system named ARES that assists data miners in the complex process of extracting the association rules from a spatial database. We also face a common problem of association rule mining, that is the high number of discovered rules. This affects both efficiency of th...

متن کامل

Mining and Filtering Multi-level Spatial Association Rules with ARES

In spatial data mining, a common task is the discovery of spatial association rules from spatial databases. We propose a distributed system, named ARES that takes advantage of the use of a multi-relational approach to mine spatial association rules. It supports spatial database coupling and discovery of multi-level spatial association rules as a means for spatial data exploration. We also prese...

متن کامل

Exploring the Relationships between Spatial and Demographic Parameters and Urban Water Consumption in Esfahan Using Association Rule Mining

In recent years, Iran has faced serious water scarcity and excessive use of water resources. Therefore, exploring the pattern of urban water consumption and the relationships between geographic and demographic parameters and water usage is an important requirement for effective management of water resources. In this study, association rule mining has been used to analyze the data of municipal w...

متن کامل

Introducing an algorithm for use to hide sensitive association rules through perturb technique

Due to the rapid growth of data mining technology, obtaining private data on users through this technology becomes easier. Association Rules Mining is one of the data mining techniques to extract useful patterns in the form of association rules. One of the main problems in applying this technique on databases is the disclosure of sensitive data by endangering security and privacy. Hiding the as...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001